Policy Learning With Observational Data
نویسندگان
چکیده
In many areas, practitioners seek to use observational data learn a treatment assignment policy that satisfies application‐specific constraints, such as budget, fairness, simplicity, or other functional form constraints. For example, policies may be restricted take the of decision trees based on limited set easily observable individual characteristics. We propose new approach this problem motivated by theory semiparametrically efficient estimation. Our method can used optimize either binary treatments infinitesimal nudges continuous treatments, and leverage where causal effects are identified using variety strategies, including selection observables instrumental variables. Given doubly robust estimator effect assigning everyone treatment, we develop an algorithm for choosing whom treat, establish strong guarantees asymptotic utilitarian regret resulting policy.
منابع مشابه
Learning from observational data with prior knowledge
ion Domain layer Inference layer Task layer Strategy layer Figure 2: Degree of abstraction theoretical view When comparing the layers according to degree of abstraction, the above ordering suggests the idea of Fig. 2. However, the design of the strategy layer is, in practice, tightly bound with the problem domain, and can be viewed as more specific than the inference and task layers. Fig. 3 is ...
متن کاملLearning Optimal Policies from Observational Data
Choosing optimal (or at least better) policies is an important problem in domains from medicine to education to finance and many others. One approach to this problem is through controlled experiments/trials but controlled experiments are expensive. Hence it is important to choose the best policies on the basis of observational data. This presents two difficult challenges: (i) missing counterfac...
متن کاملInferring interventional predictions from observational learning data.
Previous research has shown that people are capable of deriving correct predictions for previously unseen actions from passive observations of causal systems (Waldmann & Hagmayer, 2005). However, these studies were limited, since learning data were presented as tabulated data only, which may have turned the task more into a reasoning rather than a learning task. In two experiments, we therefore...
متن کاملLearning Deterministic Causal Networks from Observational Data
Previous work suggests that humans find it difficult to learn the structure of causal systems given observational data alone. We show that structure learning is successful when the causal systems in question are consistent with people’s expectations that causal relationships are deterministic and that each pattern of observations has a single underlying cause. Our data are well explained by a B...
متن کاملLearning Policies for Data Imputation with Guided Policy Search
We explore the relationship between directed generative models and reinforcement learning by developing a new approach to data imputation that combines ideas from both areas. We address data imputation by defining an MDP for which we construct policies parametrized by (reasonably) large neural networks. We then show how to train these policies using a form of (self) Guided Policy Search (Levine...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Econometrica
سال: 2021
ISSN: ['0012-9682', '1468-0262']
DOI: https://doi.org/10.3982/ecta15732